A cost-sensitive classification algorithm: BEE-Miner

نویسندگان

  • Pinar Tapkan
  • Lale Özbakir
  • Sinem Kulluk
  • Adil Baykasoglu
چکیده

Classification is a data mining technique which is utilized to predict the future by using available data and aims to discover hidden relationships between variables and classes. Since the cost component is crucial in most real life classification problems and most traditional classification methods work for the purpose of correct classification, developing cost-sensitive classifiers which minimize the total misclassification cost remains a subject of much interest. The purpose of this study is to present an effective solution method that configurates and evaluates learning systems from previous experiences, thus aiming to obtain decisions and predictions. Since most real life problems are cost-sensitive and developing effective direct methods for cost-sensitive multi-class classification is still an attractive area, a cost-sensitive classification method, the BEE-Miner algorithm, is proposed by utilizing the recently developed Bees Algorithm (BA). The main advantages of BEE-Miner are its capability to handle both binary and multi-class problems and to incorporate misclassification cost into the algorithm via generating neighbor solutions and evaluating the quality of the solutions. An extensive computational study is also performed on cost-insensitive and cost-sensitive versions of the proposed BEE-Miner algorithm and effective results on different types of problems are obtained with high test accuracy and low misclassification cost. © 2015 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proposing a Novel Cost Sensitive Imbalanced Classification Method based on Hybrid of New Fuzzy Cost Assigning Approaches, Fuzzy Clustering and Evolutionary Algorithms

In this paper, a new hybrid methodology is introduced to design a cost-sensitive fuzzy rule-based classification system. A novel cost metric is proposed based on the combination of three different concepts: Entropy, Gini index and DKM criterion. In order to calculate the effective cost of patterns, a hybrid of fuzzy c-means clustering and particle swarm optimization algorithm is utilized. This ...

متن کامل

A New Formulation for Cost-Sensitive Two Group Support Vector Machine with Multiple Error Rate

Support vector machine (SVM) is a popular classification technique which classifies data using a max-margin separator hyperplane. The normal vector and bias of the mentioned hyperplane is determined by solving a quadratic model implies that SVM training confronts by an optimization problem. Among of the extensions of SVM, cost-sensitive scheme refers to a model with multiple costs which conside...

متن کامل

FUZZY GRAVITATIONAL SEARCH ALGORITHM AN APPROACH FOR DATA MINING

The concept of intelligently controlling the search process of gravitational search algorithm (GSA) is introduced to develop a novel data mining technique. The proposed method is called fuzzy GSA miner (FGSA-miner). At first a fuzzy controller is designed for adaptively controlling the gravitational coefficient and the number of effective objects, as two important parameters which play major ro...

متن کامل

Privacy-Preserving Classification of Customer Data without Loss of Accuracy

Privacy has become an increasingly important issue in data mining. In this paper, we consider a scenario in which a data miner surveys a large number of customers to learn classification rules on their data, while the sensitive attributes of these customers need to be protected. Solutions have been proposed to address this problem using randomization techniques. Such solutions exhibit a tradeof...

متن کامل

A Honey Bee Algorithm To Solve Quadratic Assignment Problem

Assigning facilities to locations is one of the important problems, which significantly is influence in transportation cost reduction. In this study, we solve quadratic assignment problem (QAP), using a meta-heuristic algorithm with deterministic tasks and equality in facilities and location number. It should be noted that any facility must be assign to only one location. In this paper, first o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Knowl.-Based Syst.

دوره 95  شماره 

صفحات  -

تاریخ انتشار 2016